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INTRODUCTION 


Transcriptional  activators  and  repressors  are  often  involved  in  cell  cycle  control  and  are  altered 
in  breast  cancer \  Consequently,  this  structural  biology  project  focuses  on  the  following  proteins 
involved  in  cell  cycle  regulation:  the  retinoblastoma  tumor  suppressor  protein  (pRB),  DNA  viral 
oncoproteins  HPV  E7  and  Adenovirus  El  A,  the  p300/CBP-associated  factor  (PCAF)  and  p53.  pRB  is 
an  example  of  a  transcriptional  repressor  that  is  critically  involved  in  the  control  of  the  Gl-S  phase 
transition  of  the  cell  cycle^.  In  cyclin  Dl-  mediated  breast  cancer,  overexpressed  cyclin  D1  binds  to  and 
inactivates  pRB  through  phosphorylation,  which  promotes  uncontrolled  cell  proliferation^.  Since  DNA 
viral  oncoproteins  and  the  cellular  cyclin  D  protein  share  homologous  regions  that  are  essential  for 
interaction  with  pRB,  we  hypothesize  that  these  viral  oncoproteins  compete  with  and  possibly  imitate 
some  interactions  that  pRB  normally  has  with  cyclin  D  and  other  cellular  proteins.  Therefore,  a  primary 
goal  of  this  project  is  to  perform  structural  studies  of  the  pRB  tumor  suppressor  complexed  with  viral 
oncoproteins  HPV  E7  and  Adenovirus  El  A  in  order  to  gain  insight  into  pRB  function  and  high  affinity 
pRB-protein  interaction.  Although  structural  information  is  already  available  for  the  pRB  small  pocket 
domain  bound  to  a  nine  amino  acid  HPV  16  E7  peptide  (amino  acids  20-29/,  the  crystallized  E7  peptide 
is  incapable  of  inactivating  pRB  and  binds  to  pRB  with  a  twenty-fold  weaker  affinity  compared  with 
full-length  HPV  16  E7^.  Consequently,  the  constructs  of  HPV  E7  and  Adenovirus  ElA  utilized  in  this 
project  include  the  additional  pRB  inactivating  regions^’^.  A  second  focus  of  this  project  is  to  elucidate 
the  mechanism  of  human  PCAF-mediated  p53  activation  using  structural  biology.  p53  is  a 
transcriptional  activator  that  is  also  involved  in  the  control  of  the  Gl-S  phase  transition  of  the  cell 
cycle*.  p53  functions  as  a  tumor  suppressor  that  is  often  mutated  in  breast  cancer^.  Risk  of  breast 
cancer  recurrence  and  breast  cancer  related  death  is  increased  by  at  least  50%  if  p53  is  abnormal^. 
Human  PCAF  mediates  transcriptional  activation  through  its  ability  to  acetylate  nucleosomal  histone 
substrates  as  well  as  transcriptional  activators  such  as  the  p53  tumor  suppressor*®’**.  Specifically,  PCAF 
acetylates  lysine  320  of  p53  in  vitro,  resulting  in  an  increased  affinity  of  p53  to  DNA**^  Correlatively, 
lysine  320  of  p53  is  acetylated  in  vivo  in  response  to  DNA  damage**.  PCAF  is  also  inhibited  by  the 
Adenovirus  ElA  oncoprotein,  which  leads  to  the  suppression  of  PCAF  mediated  transactivation*^. 

Since  PCAF  is  targeted  by  a  viral  oncoprotein  and  modulates  p53  tumor  suppressor  activity,  the  second 
goal  of  this  project  is  to  determine  the  structure  of  the  PCAF  acetyltransferase  domain  with  coenzyme  A 
and  a  p53-derived  peptide  in  order  to  gain  insight  into  p53  activation. 
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BODY 


pRB-viral  oncoprotein  studies 

The  cDNA  of  HPV16  E7  and  HPVla  E7  was  obtained  from  Dr.  Robert  Ricciardi  and  Dr. 
Thomas  Iftner,  respectively.  Three  constructs  of  HPV16E7  and  of  HPVla  E7  were  subcloned  into  a 
pRSETA  vector  for  protein  expression  with  a  T7  promoter-T7  polymerase  expression  system  in  the 
bacterial  strain  BL21(DE3).  Constructs  of  the  full  length  HPV16  E7  (amino  acids  1-98)  and  frill 
length  HPVla  E7  (amino  acids  1-93)  were  produced  and  include  three  highly  conserved  regions 
(CR1-CR3,  Figurela)  among  DNA  viral  oncoproteins  HPVE7,  Adenovirus  ElA  and  SV40  large  T 
antigen.  Constructs  containing  the  minimal  pRB  binding  domains  (CR2-CR3,  Figure  la.  Construct 
2)  were  also  generated  for  HPV16  E7  (amino  acids  17-98)  and  HPVla  E7  (amino  acids  16-93). 
Smaller  constructs  that  included  only  the  pRB-inactivating  region  (CR3,  Figure  la,  Construct  3) 
were  also  generated  for  HPV16  E7  (amino  acids  38-98)  and  HPVla  E7  (  amino  acids  39-93).  All 
HPV16  E7  proteins,  HPVla  E7(l-93)  and  HPVla  E7  (16-93)  constructs  express  soluble  proteins  at 
37°^C  and  are  purified  to  homogeneity  through  a  combination  of  anion  exchange  (Q-sepharose), 
separation  based  on  hydrophobicity  (Phenyl  sepharose  or  ammonium  sulfate  precipitation)  and  gel 
filtration  (Superdex-200).  In  contrast,  HPVla  E7  (39-93)  protein  is  insoluble  when  expressed  at  37° 
but  is  refolded  and  purified  to  homogeneity  with  gel  filtration  (s200).  All  E7  constructs  elute  from 
the  gel  filtration  columns  in  a  single  peak  in  the  form  of  a  multimer.  Analytical  ultracentrifugation 
sedimentation  equilibrium  experiments  of  E7  constructs  indicate  that  this  protein  exists  as  a  stable 
dimer  with  a  Ka  at  or  below  the  low  nanomolar  range. 

In  addition  to  the  E7  constructs,  comparable  Adenovirus  5  El  A  constructs  containing  CRl- 
CR3  (amino  acids  36-189),  CR2-CR3  (amino  acids  1 14-189)  and  CR3  (135-189)  were  generated 
from  Adenovirus  5  El  A  cDNA  that  was  obtained  from  Dr.  Ricciardi  (Figure  la.  Constructs  4-6). 
These  constructs  are  expressed  in  bacteria  at  37°C  using  the  same  system  as  described  for  E7. 
E1A(36-189)  is  purified  with  a  combination  of  anion  exchange  (Q  sepharose),  dye  affinity 
chromatography  (Reactive  Red  and  Blue  sepharose)  and  gel  filtration  on  a  Superdex-200  gel 
filtration  column.  However,  gel  filtration  indicates  that  the  El  A(  36-189)  protein  exists  as  several 
differently  sized  multimers  that  appear  to  be  susceptible  to  degradation.  El  A(1 14-189)  and  ElA 
(135-189)  are  purified  to  homogeneity  by  anion  exchange  (Q  sepharose),  ammonium  sulfate 
precipitation  and  gel  filtration.  In  contrast  to  the  E7  dimer,  the  two  shorter  ElA  constructs  elute 
from  the  gel  filtration  colunrn  in  one  peak  that  is  consistent  with  the  molecular  weight  of  a  monomer. 

The  retinoblastoma  tumor  suppressor  protein  (pRB)  contains  two  domains  that  are  required 
for  minimal  viral  oncoprotein  interaction  (domain  A  and  domain  B,  Figure  lb).  These  two  domains 
are  referred  to  as  the  small  pocket  of  pRB.  Several  pRB  constructs  containing  these  domains  were 
subcloned  into  pRSET  A  for  bacterial  expression  (Figure  lb.  Constructs  1-5).  The  cDNA  for  full 
length  pRB  was  obtained  from  Dr  Ricciardi.  All  pRB  constructs  are  bacterially  expressed  in 
BL21(DE3)  cells.  These  constructs  produce  soluble  proteins  when  induced  at  15°C  overnight.  The 
soluble  6xhistidine  tagged  pRB  proteins  are  purified  with  a  combination  of  affinity  chromatography 
(Ni-NTA  agarose)  and  gel  filtration  on  a  Superdex-200  FPLC  column.  Each  of  these  proteins  elute 
from  the  gel  filtration  column  in  one  peak  consistent  with  the  molecular  weight  of  monomeric 
protein. 

A  bacterial  coexpression  system  has  been  developed  to  conveniently  prepare  suitable 
pRB/viral  oncoprotein  complexes  (manuscript  in  preparation).  Several  pRB  constructs  have  been 
subcloned  into  a  slightly  modified  version  of  the  pMR102  expression  vector  (Figure  lb.  Constructs 
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Figure  1.  Schematic  representation  of  pRB  and  viral  oncoprotein  constructs.  (A)  The  viral  oncoproteins 
contain  three  conserved  regions  (CR1,CR2  and  CR3).  The  constructs  of  HPV16  E7  and  Adenovirus  5  ElA 
are  shown  schematically.  Similar  constructs  were  made  for  HPVla  E7  as  described  in  the  text.  CR2  contains 
the  minimal  pRB  binding  region  LXCXE  (were  X  represents  any  amino  acid).  CR3  contains  a  Zn  binding 
region  comprised  of  two  CXXC  motifs  separated  by  a  linker.  The  CR3  region  of  E7  is  necessary  for  pRB 
inactivation.  (B)  pRB  is  a  928  amino  acid  protein  that  contains  two  domains  (A  and  B)  which  are  necessaiy 
for  viral  oncoprotein  interaction.  Five  6Xhistidine  tagged  pRB  constructs  are  shown.  All  constructs  contain 
domain  A  and  domain  B.  Construct  1  extends  to  the  C-terminus  of  the  protein.  Construct  2  and  construct  5 
extend  to  the  2““*  E7(CR3  region)  binding  site  (amino  acids  792-843)*.  Construct  3  and  construct  4  extend  to 
the  end  of  domain  B.  Construct  4  and  construct  5  have  deletions  from  579  to  621  in  the  flexible  linker  region 
ofpRB^ 

2-5).  The  kanamycin  resistant  pMR102  expression  vector  is  used  for  pRB  coexpression  with  viral 
oncoproteins  from  the  ampicillin  resistant  pRSET  vector.  pRB  and  viral  oncoproteins  are 
coexpressed  in  bacteria  at  15°C  and  purified  with  affinity  chromatography  (Ni-NTA  agarose)  and  gel 
filtration  (Superdex-200).  Several  of  the  following  pRB-viral  oncoprotein  complexes  have  been 
purified:  (pRB(376-792)/ElA(36-189),  pRB(376-792)/ElA(l  14-189),  pRB(376-792)/HPV16  E7(l- 
98)  and  pRB(376-792)/HPV16  E7(17-98)).  Gel  filtration  reveals  that  the  pRB/E7  complex  elutes 
from  the  column  as  a  molecular  weight  species  that  is  greater  than  a  1:1  stoichiometric  complex.  In 
contrast,  gel  filtration  and  analytical  ultracentrifugation  sedimentation  equilibrium  experiments 
demonstrate  that  the  pRB/El  A  complex  has  a  stoichiometry  of  1 :1 . 

All  of  the  above  complexes  have  been  utilized  for  crystallization  trials  with  several  different 
factorial  screens.  To  date,  these  complexes  have  resisted  crystallization.  The  recently  determined 
three  dimensional  crystal  structure  of  the  pRB  small  pocket  bound  to  a  9  amino  acid  E7  peptide'*  has 
provided  some  insight  into  why  these  coexpressed  complexes  have  not  crystallized.  The  previously 
crystallized  pRB-E7  peptide  complex  was  only  able  to  crystallize  when  the  linker  region  between 
domain  A  and  domain  B  of  pRB  was  removed  by  proteolysis.  The  pRB(376-792)  region  that  was 
used  for  coexpression  and  crystallization  trials  in  this  study  included  this  flexible  linker  region. 
Therefore,  a  construct  that  contains  the  small  pocket  of  pRB  with  a  truncated  linker  (Figure  lb. 
Construct  5)  has  been  made  for  crystallization  trials.  This  new  pRB  construct  (376-843AL)  also 
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contains  a  region  just  C-terminal  to  the  small  pocket  which  is  required  for  E7  CR3  binding  (Figure 
lb.  Construct  5).  This  pRB  construct  has  been  expressed  alone  and  purified  to  homogeneity. 
Currently,  a  purification  protocol  is  being  developed  to  obtain  suitable  pRB(376-843  AL)/viral 
oncoprotein  complexes  for  crystallographic  studies. 

Although  pRB/viral  oncoprotein  complexes  have  resisted  crystallization,  crystals  of  the 
HPVla  E7(39-93)  protein  have  been  obtained  and  tested  for  diffi-action  (Figure  2).  These  E7 

crystals  diffracted  to  approximately  2. 8 A  at  the  Brookhaven  synchrotron  beamline  X4A.  However, 
the  diffraction  spots  from  the  crystal  could  not  be  processed  and  indicated  that  the  crystal  was  highly 
mosaic.  These  HPVla  E7  crystals  are  currently  being  refined  in  order  to  obtain  higher  quality 
diffracting  crystals.  In  addition  to  E7  crystallization,  an  excellent  NMR  spectrum  of  Adenovirus  5 
El  A  (135-198)  CR3  region  has  been  obtained  recently.  The  NOE  and  chemical  shift  data  in  the 

Figure  2.  Crystals  of  HPVla  E7(39-93) 
protein.  Purified  HPVla  E7{39-93)  protein 
was  crystallized  by  a  hanging  drop  vapor 
diffusion  method.  The  reservoir  crystallization 
condition  contains  0.5-1. 5  M  NaCl,  20-30  % 

Ethanol  and  O.IM  Hepes,  pH  7.5.  Crystals 
tend  to  grow  as  plates  with  the  average 
dimensions  of  200|uim  X  200|uim  X  20pm. 


% 


2D-NOESY  spectrum  of  El  A  are  consistent  with  the  presence  of  a  mixed  P-strand,  a-helix  structure 
(Figure  3).  Therefore,  the  NMR  structure  of  the  El  A  CR3  region  is  being  pursued  as  well 

Figure  3.  Preliminary  2D-NOESY  spectrum  for  the 
Adenovirus  5  ElA  CR3  region.  Adenovirus  5 
E1A(135-189)  was  used  in  this  study.  The  NOE  and 
chemical  shift  data  for  this  domain  are  consistent 
with  the  presence  of  a  mixed  p-sheet,  a-helix 
structure.  The  line  widths  and  relaxation  times  of 
protons  are  consistent  with  a  monomeric 
Adenovirus  5  ElA  species. 


9.0  8.4  7.8  7.2  6.6 

Proton  (PPM) 


8 


PCAF  transcriptional  coactivator  studies 


A  DNA  construct  that  encoded  for  amino  acids  493  to  658  of  PCAF  (plus  N-terminal  met-lys) 
was  subcloned  into  the  pRSET-A  vector  for  bacterial  expression.  After  overexpression  of  PCAF  at  15°c 
for  12  hours,  the  majority  of  the  recombinant  protein  was  found  in  the  soluble  cell  extract.  Soluble 
p/CAF  protein  was  purified  by  a  combination  of  cation  exchange  chromatography  (SP-sepharose), 
affinity  chromatography  (Coenzyme  A-agarose)  and  size  exclusion  chromatography  (Superdex  200). 

The  monomeric  protein  was  then  concentrated  to  approximately  20-40  mg/ml,  flash  fi'ozen,  and  stored  at 


-70°C. 


For  protein  crystallization,  lOmg/ml  of  the  PCAF  protein  was  mixed  with  2-fold  molar  excess  of 
Na-acetyl  coenzyme  A.  Crystals  were  grown  by  the  hanging  drop  vapor  diffusion  method  at  room 
temperature.  The  crystallization  mixture  contained  1.3-1.6M  Li2S04  and  O.IM  Tris-Cl  (pH  8.5).  Rod¬ 
shaped  crystals  generally  appeared  after  2  to  3  weeks  with  average  cell  dimensions  of  0.2mm  X  0.08mm 
X  0.08nim.  Crystals  were  then  slowly  and  sequentially  transferred  into  a  cryoprotectant  solution 
containing  1.5M  Li2S04,  O.IM  Tris-Cl  (pH  8.5),  and  15%  Ethanol.  The  crystals  were  then  flash  frozen 
in  15%  ethanol  with  liquid  propane  for  data  collection.  A  native  data  set  was  collected  on  Beamline  X4- 
A  (?t=l  .0009A)  at  the  National  Synchrotron  Light  Source  at  Brookhaven  National  Laboratory.  The  data 
was  collected  in  1°  oscillations  using  a  Raxis-IV  area  detector.  The  programs  Denzo  and  Scalepack*^ 
were  used  to  process  and  scale  the  data. 

Two  solutions  for  the  PCAF-coenzyme  A  complex  were  obtained  at  10.0  to  4.0A  by  molecular 
replacement  using  the  coordinates  of  a  partially  refined  model  (R=30.1%,  Rfree='34.1%)  oiapo 
Tetrahymena  GCN5  with  program  AMORE’"*.  Prior  to  initial  rigid  body  refinement,  a  randomly  chosen 
10%  of  the  total  number  of  reflections  was  designated  as  a  test  data  set  and  all  residues  in  the  model  that 
were  not  identical  to  PCAF  residues  were  alanized.  The  initial  electron  density  maps  generated  with 
Fourier  coefficients  2|Fo|-|FcI  and  |Fo|-iFc|  showed  clear  side  chain  density  for  most  of  the  PCAF  specific 
residues.  The  alanized  residues  with  p/CAF-specific  side  chain  iFo|-|Fc|  electron  density  were  replaced 
by  p/CAF-specific  residues  using  the  program  O.  After  one  round  of  simulated  annealing  from  8.0  to 
3.0A,  |Fo|-|Fc|  electron  density  maps  showed  strong  peaks  for  the  pantothenic  acid  and  the 
pyrophosphates  of  the  3 ’phosphate  ADP  moiety  in  Coenzyme  A.  Refinement  proceeded  by  multiple 
rounds  of  positional  refinement,  simulated  annealing,  and  torsion-angle  dynamics  with  periodic  model 
building  in  O.  Refinement  was  carried  out  in  resolution  steps  of  3.0, 2.7, 2.5,  and  2.3A  using  the 
program  X-PLOR  3.8’^  and  CNS-SOLVE’^.  At  the  final  stages  of  refinement,  a  bulk  solvent  correction 
was  applied  using  data  from  20.0-2.3A  and  tightly  constrained  B-factor  refinement  was  performed  using 
the  program  CNS-SOLVE.  Ordered  water  molecules  were  built  into  strong  |Fo|-|Fc|  peaks  and  only 
retained  if  possible  H-bond  partners  could  be  located  and  if  they  refined  to  a  reasonable  B  factor. 

Protein  A  in  the  asymmetric  unit  resulted  in  a  model  containing  amino  acids  493-653  plus  an  N-terminal 
lysine.  Protein  B  in  the  asymmetric  unit  resulted  in  a  model  containing  493-652  plus  the  N-terminal 
lysine.  For  protein  B  only,  density  was  not  observed  for  solvent  exposed  residue  side  chains  503, 505, 
625,  626,  627,  63 1 ,  636  and  were  therefore  modeled  as  alanines.  The  final  model  of  each  protein-CoA 
complex  has  good  geometry  (Table  1)  with  none  of  the  non-glycine  residues  lying  in  disallowed  regions 
of  the  Ramachandran  plot. 

The  2.3A  crystal  structure  of  the  PCAF  protein  acetyltransferase  domain  reveals  an  a/p 
globular  fold  that  contains  a  central  protein  core  which  sits  at  the  base  of  a  pronounced  cleft  that  is 
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Table  1,  Crystallographic  Data  and  Refinement  Statistics 


formed  by  the  N-  and  C-terminal  protein  segments  (Figure  4).  The  protein  core  at  the  base  of  this  cleft 
makes  extensive  contacts  with  the  pantetheine  arm  of  coenzyme  A,  marking  the  active  site  of  the 
enzyme.  Additionally,  correlation  of  this  structure  with  extensive  mutagenesis  data  for  PCAF 
and  for  the  homologous  yeast  GCN5  protein  implicates  this  cleft  and  the  N-  and  C-terminal  segments  to 
play  an  important  role  in  histone  or  p53  substrate  binding.  Inspection  of  this  mutationally  sensitive 
region  suggests  that  a  glutamate  residue  within  the  protein  core  plays  a  catalytic  role  for  protein 
acetylation.  From  this 
crystallographic  study,  a 
catalytic  mechanism  for  the 
acetylation  of  histones  and 
p53  is  proposed.  In  order 
to  gain  insight  into  p53 
substrate  specificity, 
crystallization  trials  of 
PCAF  with  coenzyme  A 
and  a  p53 -derived  peptide 
are  underway. 


Crystal  Parameters 

Data  Collection  Statistics 

Unit  Cell  Dimensions 

Resolution  Range 

20.0-2.3  A 

a=97.00  A ,  b=97.00  A,  c=77.85  A 

Total  Reflections 

94,731 

a=90.00*,  p=90.00“,  y-120.00® 

Unique  Reflections 

17,943 

R.yn,(%) 

4.0  (15.5) 

Space  Group 

P64 

//sigma  (/) 

18.0(4.8) 

Asymmetric  Unit 

2  molecules 

Completeness  (%) 

96.5  (99.8) 

Reflnement  Statistics 

Resolution  Range 

20.0-2.3  A 

R.m.s.  Values 

I/c  cutoff 

0.0 

Bond  length  (A) 

0.007 

Final  Model 

Bond  angles  O 

1.89 

Protein  atoms 

2606 

NCS  molecules  (A) 

1.38 

Water  atoms 

109 

B-factors  (A^) 

1.64 

CoA  atoms 

96 

Average  B-factors  (A^) 

R*factors 

Protein  (A/B)* 

31.5/40.7 

Rworldng 

22.3% 

Water 

39.0 

Rfrce 

26.8% 

CoA  (Amy 

39.4/52.2 

R-factor:R..rtu„j=l|  |/z|F,|;  R,™,=  St  I  IfJ-|f.|  1/St  lF„|,whereT 

is  a  test  data  set  of  10%  of  the  total  reflections  randomly  chosen  and  set  aside  before 
reflnement. 

*A  and  B  refer  to  complexes  A  and  B  in  the  asymmetric  unit  cell  The  numbers  in 
parentheses  are  for  the  highest  resolution  bins. _ _ 


Figure  4.  Structure  of  the  PCAF-coenzyme  A 
complex.  The  four  domains  of  the  protein  are 
color-coded.  The  core  of  the  protein,  motifs  A- 
D  (as  defined  for  the  diverse  family  of  N- 
acetyltransferases)  and  motif  B’(based  on 
structural  conservation),  are  colored  blue  and 
green,  respectively.  The  N-  and  C-terminal 
protein  segments  flanking  the  core  are  colored 
magenta  and  gold,  respectively.  Coenzyme  A  is 
colored  red. 


In  summary,  this  study  has  demonstrated  that  HPV16  E7  and  HPVla  E7  proteins  exist  as 
stable  dimers,  while  comparable  constructs  of  Adenovirus  5  El  A  exists  as  monomers.  Bacterial 
coexpression  can  be  utilized  to  form  stable  pRB/viral  oncoprotein  complexes.  pRB/El  A  complexes 
have  a  stoichiometry  of  1 : 1 ,  whereas  the  pRB/E7  complex  has  a  greater  stoichiometry.  Although 
pRB/viral  oncoprotein  complexes  have  resisted  crystallization  to  date,  a  new  “linkerless”  pRB 
construct  has  been  generated  for  crystallization  trials  with  viral  oncoproteins.  The  HPVla  E7(39-93) 
crystals  are  being  refined  to  obtain  high  quality  diffracting  crystals.  Structural  studies  using  NMR 
are  currently  being  performed  with  the  Adenovirus  5  E1A(135-189)  CR3  region.  Additionally,  the 
crystal  structure  of  the  human  PCAF  acetyltransferase  domain  has  been  solved  to  2.3A  and  provides 
tremendous  insight  into  the  mechanism  of  histone  acetylation  and  p53  activation. 
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APPENDIX 


Key  Research  Accomplishments 

•  A  bacterial  coexpression  system  has  been  developed  for  convenient  pRB/viral 
oncoprotein  complex  formation. 

•  pRB/viral  oncoprotein  complexes  have  been  purified  from  bacteria. 

•  This  study  reveals  that  the  HPV  E7  exists  as  a  stable  dimer,  while  corresponding 
Adenovirus  5  El  A  constructs  exist  as  monomeric  species. 

•  This  study  reveals  that  the  stoichiometry  of  the  pRB/El  A  complexes  is  1 : 1 ,  while  the 
pRB/E7  complex  exists  as  a  larger  oligomer. 

•  The  HPV  1  a  E7  CR3  region  has  been  crystallized. 

•  The  Adenovirus  5  El  A  CR3  region  gives  an  excellent  NMR  spectrum. 

•  The  structure  of  the  human  PCAF  protein  acetyltransferase  domain  bound  to 
coenzyme  A  has  been  solved  by  X-ray  crystallography. 
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The  human  factor,  PCAF, 

mediates  transcriptional  activation  through  its  ability 
to  acctylote  nocleosomal  histone  substrates  as  well 
as  transcriptional  actlvaUN^  such  as  p53-  We  have 
determined  the  2^  A  crystal  structure  of  the  histone 
acetyltransferase  (HAT)  domain  of  FCAF  bomul  to 
coenzyme  A.  The  structure  reveals  a  central  protein 
core  associated  with  coenzyme  A  binding  and  a  pro¬ 
nounced  deft  that  sits  over  the  protein  core  and  is 
flanked  on  opposite  sides  by  the  N-  and  C-tcnniiial 
protein  segments.  A  correlation  of  the  structure  with 
the  exten^ve  mutagenesis  data  for  FCAF  and  the 
homologous  yea^  GCN5  protdn  implicates  the  deft 
and  the  N-  and  C-temiinal  protein  segments  as  playing 
an  important  role  in  histone  substrate  binding,  and  a 
glutamate  residue  in  the  protein  core  as  playing  an 
essential  catalytic  role.  A  structural  comparison  with 
the  coenzyme-bound  forms  of  the  related  AT-acetyl- 
transferases,  HATl  (yeast  histone  acetyltransferase  1) 
and  SmAAT  {SerraHa  marc^sum  aminoglycoside 
3-^-aoetyltransfcrase),  suggests  the  mode  of  substrate 
binding  and  catalysis  by  these  enzymes  and  establishes 
a  paradigm  for  understanding  the  structuro-fiinction 
relationships  of  otiicr  enzymes  that  acetylate  histones 
and  transcriptional  regulators  to  promote  activated 
transcription. 

Keywords:  aceiyltransferasc/coacxivatoT  HAT/p300/CBP- 
associating  factor 


Introduction 

The  PCAF  (for  p300/CBP-associating  factor)  transcrip¬ 
tional  coactivator  was  identified  initially  through  its  ability 
to  interact  with  p300/CBP  for  the  transcriptional  activation 
of  many  genes,  and  to  counteract  the  ability  of  the 
adenoviral  £1 A  oncoprotein  to  inhibit  pSOO/'CBP-mediated 
transcriptional  activation  (Yang  ct  ai^  1996).  The  same 
study  showed  that  PCAF  contains  intrinsic  histone  acetyl- 
transferase  activity,  a  property  previously  demonstrated 
for  the  GCN5  transcriptional  coactivaior  (Marcus  et  al, 
1994;  Brownell  tt  aL  1996),  and  is  conelared  with 
Ixanscriptional  activation  (Brownell  and  Aliis,  1996; 


Wolffe  and  Pniss,  1996;  Grunsiein,  199^).  More  recently, 
PCAF  has  also  been  shown  to  interact  with  the  DNA- 
binding  domain  of  nuclear  receptors  such  as  RXR/RAR, 
independently  of  p300/CBP  binding,  to  promote  retinoid- 
responsive  transcriptional  activation  (Blanco  et  aL^  1998), 
and  has  been  shown  to  interact  directly  with  El  A  resulting 
in  an  inhibition  of  its  intrinsic  histone  acetyltransferase 
activity  and  its  ability  to  mediate  transcriptional  activation 
(Reid  et  aL  1998;  Chakravarii  et  ah,  1999). 

Analysis  of  the  primary  sequence  of  the  832  residue 
PCAF  protein  reveals  that  it  contains  a  C -terminal  brorao- 
domain  (within  residues  725-819),  a  central  histone  acetyl¬ 
transferase  (HAT)  domain  (within  residues  493-^53) 
highly  homologous  to  the  GCN5  transciptional  coactivaior 
Ifrom  Tetrahymena  (Brownell  et  <d.,  1996)  and  from 
yeast  (Marcus  et  al.,  1994)1  and  a  structurally  divergent 
N-terminal  region  (Yang  et  aL^  1996).  More  recently,  Roth 
and  colleagues  have  shown  that  the  N-tenninal  region  of 
PCAF  shares  homology  with  the  predominant  form  of 
mammalian  GCN5  (Xu  etal,,  1998).  Functional  character¬ 
ization  of  the  N-tenniiial  segment  ot  PCAF  shows  that  it 
contains  an  interaction  surface  for  p300/CBP  (Yang  et  a/., 
1996;  Xu  €t  aL  1998),  other  transcriptional  activators 
(Currie,  1998;  Krumm  ctaL,  1998)  and  El  A  (Chakravaiti 
€{  oi.,  1999),  and  is  required  for  nucleosomal  acetylation 
mediated  by  the  PCAF  HAT  domain  (Yang  et  a/.,  1996), 

The  HAT  domain  of  PCAF  has  been  analyzed  extens¬ 
ively  at  the  aiuino  acid  and  functional  levels-  The  HAT 
domain  of  PCAF  shares  a  high  degree  of  sequence 
homology  with  GCN5  from  various  species  (OCN5/PCAF 
subfamily  of  histones  acecyltransferases)  (Marcus  cr  aL 
1994;  Brownell  et  al,^  1996;  Candau  et  aL  1996;  Smith 
et  aJ.,  1998a)  and  has  functional  horaolog>'  with  other 
transcriptional  coactivators  that  harbor  HAT  activity’ 
including  veast  ESAl  (Smith  et  al.,  1998b),  and  human 
CBP/300  (bgryzico  etaL  1996).  T.AJFd250  (Mizzen  et  aL 
1996),  Tjp60  (Yamamoto  and  Horikoshi,  1997),  ACTR 
{Chen  et  aL  1997)  and  SRC-1  (Spencer  et  aL  1997). 
More  recently,  detailed  sequence  analysis  has  revealed 
that  the  HAT  domain  of  PCAF  shares  limited  sequence 
homology  with  a  biologically  diverse  family  of  GCN5- 
related  ^-acetyltransfeTascs  (GNATs)  within  three  rela¬ 
tively  small  motifs  (15-33  residues)  called  A,  D  and  B 
(Ncuwald  and  Landsman,  1997). 

In  vivo,  PCAF  has  been  shown  to  function  in  tlte  context 
of  a  large  multisubonit  protein  complex  with  >20  distinct 
polypeptides  including  several  of  the  TATA-blnding 
protein  (TBP)-as50ciaicd  factors  (TAFs)  and  human 
counterparts  to  the  yeast  ADA2,  AD  A3  and  SPT3  proteins 
(Ogryzko  et  aL  1998),  The  histone  substrate  specificity 
of  PCAF  has  been  characterized,  showing  a  strong  prefer¬ 
ence  for  histone  H3  and  to  a  lesser  extent  histone  H4 
(Yang  et  aL  1996;  Xu  et  aL  1998),  Interestingly,  unlike 
yeast  GCN5  (Kuo  ei  aL  1996.  1998;  Wang  ef  aL  199S), 
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the  histone  preference  of  PCAF  appears  to  be  similar  for 
both  free  and  nudccsomal  histones.  Surprisingly,  PCAF 
has  also  been  reported  to  acctylate  noa-histone  substrates 
Including  the  basal  transcription  factors  TFIDF  and  the 
P-subunit  of  THIE  (ImJiof  et  al,  1997).  Recendy,  we 
have  reported  that  PCAF  spedflcaJly  acetylates  Lys320 
of  the  p53  transcriptional  activator  In  vitro,  resulting  in 
an  increased  affinity  of  p53  for  DNA  (Liu  et  oL,  1999). 
Conelatively,  we  find  that  these  same  sites  arc  acetylated 
in  vivo  in  response  to  DNA  damage.  Most  recently,  the 
histone  acctyltransferase  activity  of  PCAF  towards  both 
nucleosomal  histones  and  p53  has  been  shown  to  be 
inhibited  by  the  direct  binding  of  El  A  to  its  HAT  domain 
(Chakiavarti  et  al.^  1999).  In  order  to  obtain  a  detailed 
view  of  the  mechanism  of  protein  acetylation  by  PCAF, 
we  have  determined  the  crystal  structure  of  its  HAT 
domain  in  complex  with  coenzyme  A  to  a  resolution 
of  23  A. 

Results  and  discussion 

Overall  structure  of  the  PCAf^oenzyme  A 
complex  (p5)S 

The  HAT  domain  of  human  PCAF  (residues  49M55a) 
was  overexpressed  in  Escherichia  coii  and  purified  to 
homogeneity  using  a  combination  of  cacion-excliange, 
coenzyme  A  affinity  and  gel  filtration  chromatography. 
Crystals  were  obtained  containing  two  protomers  per 
asynunetric  unit  and  the  structure  was  detennined  by 
molecular  replacement  using  tite  unrefined  scrucnire  of  the 
nascent  HAT  domain  of  Tetrahymena  GCN5  as  a  search 
model  (J.R.Rojas,  R.C.Tricvel,  YMo,  X.Li,  J,Zhou. 
S-L.Berger,  C.D>Jlis  and  R.Marmor5tein,  submitted) 
(Tabic  I).  The  two  PCAF  protomers  in  the  asymmetric 
unit  make  modest  interprotein  interactions  and  have  nearly 
identical  struciT^,  with  an  r.m.8.  deviation  between  all 
atoms  of  IJ8  A. 

PCAF  has  a  topology  and  contains  a 

globular  fold  except  for  a  pronounced  cleft  along  one  side 
of  the  protein  (Figure  1  A).  It  is  convenient  to  ihinlc  of  the 
core  as  being  formed  by  two  tertiary  structural  elements 
near  the  center  of  the  protein.  The  first  element  con^s 
p-strands  2.  3  snd  4  aligned  in  an  antiparallcl  orientation 
on  top  of  helix  a3,  while  the  second  element  is  formed 
by  an  adjacent  p5-strand--loop-a4-helix.  The  coenzyme  A 
cofactOT  is  bound  between  the  two  elements  of  the  core 
along  one  edge  of  tl)c  protein  with  its  labile  sulfhydryj 
pointing  into  die  protein  cleft  which  is  flanked  on  opposite 
sides  by  tJje  N-  and  C-tcnuinal  domains  of  the  protein. 
Within  the  N-terminal  domain,  a  p-strand  forms  sheet 
interactions  with  the  p2-strand  of  tiie  core,  and  a  helix- 
turn-helix  (al-tuni-a2)  sits  on  one  side  of  the  protein 
above  the  core.  The  C-terminal  domain  contains  a  helix- 
loop-strand  (a5-loop“P6)  which  lies  opposite  the 
N-terrainal  domain  above  the  protein  com  and  interacts 
with  the  core  domain  through  parallel  sheet  interactions 
between  p5  and  p6. 

Mode  of  coenzyme  A  binding  by  PCAF 

The  coenzyme  A  cofactor  is  bound  in  a  cavity  formed  on 
the  surface  of  the  core  region  of  PCAF  and  buries  over 
one-half  of  tiie  coenzyme  A  accessible  surface  area  and 
•-520  of  protein  surface  area  (Figures  lA  and  2).  It  is 
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flanked  by  the  p4-loop-oc3  segment  that  corresponds  to 
motif  A  of  the  GNAT  proteins  on  one  side  and  Che  P5- 
loop-a4  segment  coiresponding  to  motif  B  of  the  GNAT 
proteins  on  the  other  side  (Figures  IB  and  2).  Coenzyme  A 
is  bound  in  a  bent  conformation  (Figure  2C)  which  helps 
facilitate  an  extensive  set  of  protein  interactions  that  are 
mediated  predominantly  by  the  pantetheine  arm  and  the 
pyrophosphate  group  of  coenzyme  A  (Figure  2A).  Strik¬ 
ingly,  all  but  two  groups  of  the  16  member  pantheteine 
aim-pyrophosphate  chain  are  contacted  by  die  protein. 
AD  but  one  of  these  contacts  are  mediated  through  either 
protein  backbone  hydrogen  bonds  or  proieiii  side  chain 
van  dcr  Waals  contacts. 

PCAF  residues  in  the  GNAT  conser/cd  motifs  A  and 
B  interact  extensively  with  coenzyme  A.  Specifically, 
residues  580  and  582-5S7  in  die  04-  loop-a3  region  of 
motif  A  make  an  extensive  set  of  both  dirccr  and  water- 
mediated  hydrogen  bonds  with  tlie  pyrophosphate  group 
(Figure  2).  Thr587  also  makes  die  only  side  chain  hydrogen 
bond  to  the  coenzyme,  through  a  pyrophosphate  oxygen. 
The  aliphatic  side  chain  of  GJn58l  and  a  Cys-AJa-Val 
sequence  (residues  574-576)  at  the  tip  of  the  04-strand 
m^es  an  extensive  set  of  van  der  Waals  contacts  through¬ 
out  most  of  the  length  of  the  pantetheine  aim.  In  addition, 
the  backbone  residues  of  Cy5574  and  Val576  form  hydro¬ 
gen  bonds  with  the  pancetheine  arm.  Residues  in  die  05*- 
]oop-a4  region  of  GNAT  motif  B  make  predominantly 
van  dcr  Waals  contacts  with  the  0-mcrcapioethylaiuine 
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Fig.  2.  The  coenryme  A-bindng  sire.  (A)  Schcnifitic  drawing  Of  PCAF  iweractioDi  with  coenzyme  A.  Hydregrn  bonds  arc  hidicaicd  with  b?ack 
arrows,  and  van  der  Waals  inleracUons  are  indicated  with  white  arrows.  (B)  CoenTynio  A-proccin  interactions.  Protein  residues  that  mak;  van  der 
Waals  ccTitac*j5  and  hydiogcn  bondi  (dofied  UncJ  ire  indicated.  (C)  oA-welghied  F, omit  map  around  J>c  pantetheine  arm  of  die  coenr/mc  A 
cofactor.  The  map  was  generated  by  omitdng  Tcsidaea  within  a  4.S  A  radius  of  the  ccfactor  followed  by  simulated  annealing  dynarnics  refinement  at 
a  temperature  of  1000  K..  The  map  is  contoured  at  o.  A  portion  of  coenzyme  A  is  indicated  In  red  and  the  surrounding  protein  is  indicated  in 
green.  The  gold  spheres  indicate  ordered  water  molecules. 
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Crystal  structure  of  the  PCAf-coenzyme  A  complex 


FJg,  X  FunctioiuU  impUcations  of  the  PCAF-coenjtytne  A  complex.  (A)  Highly  conserved  lesidues  wicbin  the  OCN5/PCAF  subfamily  of 
ace(>ltrans&Tascs  aie  mapped  onto  the  1PCA5  proteunu  Residues  that  are  associated  with  coeaayme  A  interaction  are  shown  in  led,  residues  tliat  ate 
implicated  m  subst:^  binding  and/or  catalysis  ore  ihowzi  in  gcoem  The  renuliiung  strictly  conserved  residues  chat  Lie  largely  buried  and  presumably 
impqrtart  for  protein  stability  am  qmitJed  fqr  clarity.  Residue  numbers  at  the  borders  of  secondary  structural  elcmcatt  arc  indicated  for  reference- 
(B)  Mutation?  itl  human  PCAF  (Martincz-Balbas  et  aL,  1998)  and  yeast  OCnS  CKuo  et  ai,  1998;  Wang  ct  af,  2998)  that  decrease  HaT  activity  are 
mapped  onto  a  schematic  tepresentation  of  the  PCAF  HAT  domain.  The  color  coding  is  Cw  same  as  in  (A),  and  residues  involved  in  protein  stabiUty 
ire  shown  in  gold. 


segment  of  the  pantetheine  arm  and  thus  play  a  major 
role  in  orienting  the  reactive  sulfhy^l  atom  (atom  1. 
Hgure  2)  for  acetyl  transfer.  The  protein  residues  involved 
are  A3a6i3,  Tyf616  and  Phe6l7,  while  Tyr616  also  makes 
van  der  Waals  contacts  with  the  end  of  the  pantetheine 
arm  near  the  pyrophosphate  group. 

l>vo  residues  in  the  non-conserved  (within  the  GNAT 
family)  N-icrrainal  segment  of  PCAF  also  interact  with 
coenz)"me  A.  These  residues,  Gln525  and  Leu526,  which 
sit  above  the  core  and  on  one  side  of  the  putative  substrate- 
binding  cleft,  make  van  der  Waals  contacts  with  the 
pantetheine  arm  of  coenzyme  A  (Figure  2).  Tlie  proximity 
of  these  rcsiduc.s  to  the  cofactor-substrate  junction  sug¬ 
gests  that  they  play  an  important  role  in  substrate- specific 
binding  and/or  catalysis.  In  contrast  to  the  pantedieine 
arm  and  pyrophosphate  group  of  coenzyme  A,  which 
make  extensive  protein  interactions  that  are  conser\cd 
between  both  PCAF  protomers  in  the  asynmietric  unit 
ceil,  the  adenosine  base  of  the  3 '-phosphate  adenosine 
group  interacts  less  extensively  in  the  PCAF-coenzyme  A 
/\ 

M/i+h  prt'jcm 


complex.  In  general,  residues  in  die  a4-helix  make  van 
der  Waals  contacts  with  the  adenosine  base:  however,  the 
contacted  atoms  differ  between  the  two  PCAF  protomers 
of  die  asymmetric  unit  cell,  and  die  3'-phosphate  ADP 
group  is  structurally  variable  between  die  two  protomers. 
The  functional  importance  of  die  PCAF~cocnzyme  A 
interaedens  correlates  almost  perfectly  with  the  amino 
acid  conservation  within  the  GCN5/PCAF  subfamily  of 
acciyl transferases  and  mutational  analysis  (Figures  IB  and 
3).  Strikingly,  13  of  the  17  protein  residues  that  contact 
coenzyms  A  are  strictly  consen'ed  within  the  GCN5/ 
PCAF  subfamily  of  acetyltransferases  (this  includes 
Gly6l5  which  makes  variable  van  der  Waals  contacts  with 
the  adenosine  base),  and  of  the  remaining  four  residues 
<mly  conservative  changes  are  observed  (Figures  IB  and 
3 A).  In  addition,  12  of  the  protein  residues  that  contact 
coenzyme  A  are  sensitive  co  mutation  in  the  form  of  either 
single  (Kuo  et  aL,  1998)  or  triple  (Wang  ct  al,  1998) 
alanine  substitutions.  In  particular,  the  yeast  GCN5  muta¬ 
tion  KQL  (corresponding  to  residues  52^526  in  the  al- 
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loop  region  of  PCAF),  and  IGY  and  FKK  (corresponding 
to  residues  614-^16  and  617-619  in  dbe  a4  helix  of 
PCAF)  were  among  the  most  debilitating  mutations  for 
both  growth  and  transcription  in  vivo  and  histone  acetyl* 
transferase  activity  in  vitro  (Wang  et  aly  1998).  Moreover, 
single  mutations  of  nearly  all  the  residues  in  the  pwoop- 
Ct3  region  that  mskt  coenzyme  A  contacts  in  our  structure 
have  dramatic  effects  on  HAT  activity  in  vitro  {Kuo 
et  aly  1998). 

Hhtone/traoscrijpftiQn  factor  substrata  bfndfng  by 
PCAF 

A  striking  feature  of  the  PCAF-coenzyroc  A  complex  is 
die  pronounced  cleft  that  is  situated  above  the  protein 
core  and  fianked  on  opposite  sides  by  the  N-  and  C-icrrainal 
protein  segments.  There  are  several  soructurai  character¬ 
istics  of  this  cleft  which  implicate  it  as  site  for  binding 
by  histone  and  transcription  foctor  substrates.  First,  at  the 
base  of  the  cleft  is  an  acidic  patch  formed  by  the  side 
chains  of  Glu570  and  Asp610,  as  well  as  the  backbone 
carbonyls  of  lle571,  Val572  and  Tyi608,  creating  an 
attractive  site  for  the  basic  lysine  substrate  (Figure  4B), 
Secondly,  the  cleft  has  approximate  dimensions  of 
lOX  10X20  A  which  could  easily  accommodate  a  protein 
strand  harboring  the  reactive  lysine  side  chain  (Figure  1  A). 
Thirdly,  relatively  flexible  loops  (with  relatively  high 
atornic  5-factors)  sit  direcUy  above  the  cleft  between  the 
al-€c2  and  regions  and  ore  in  position  to  undergo 
any  minor  structural  reanrangements  that  may  be  necessary 
to  accommodate  substrate  binding  (Figure  4A)*  Most 
importantly,  the  cleft  sits  directly  above  the  coenzyme  A 
cofactor  in  the  appropriate  geometrical  juxtaposition  for 
catalysis. 

The  high  degree  of  amino  acid  conservation  within 
the  GCN5/PCAF  subfamily  of  acctylcransferascs  and  the 
mutational  sensitivity  of  regions  proximal  to  the  cleft 
is  consistent  with  its  importance  in  substrate  binding 
(Figures  IB  and  3).  A  mapping  of  highly  conserved 
residues  within  the  GCN5/PCAF  subfamily  onto  the  PCAF 
structure  shows  that  a  large  number  of  them  map  to  buried 
residues  important  for  protein  stability  or  to  residues  that 
interact  with  coenzyme  A.  Significantly,  the  majority  of 
the  remaining  residues  map  to  regions  within  or  flanking 
tlie  pronounced  protein  cleft  that  sits  above  the  core 
(Figure  3A).  In  particular,  regions  proximal  to  the  two 
loop  regions  flanlung  the  cleft  contain  large  patches  of 
conserved  residues.  Specifically,  residues  525-534  (QLP- 
XMPKEYI)  in  the  loop“a2  region  and  residues  635-646 
(GYIKDYXGATLM)  in  the  loop-P6  region  arc  highly 
conserved  and  are  in  position  to  Interact  with  a  substrate 
tlia:  may  bind  in  the  protein  cleft  Correlating  well 
with  the  importance  of  these  residues  is  their  mutation 
sensitivity  in  the  yeast  GCN5  homolog  for  growth  and 
transcription  in  vivo  and  HAT  activity  in  vitro  (Kuo  et  aL 
I99S;  Wang  et  aL,  1998)  (Figure  3B).  Specifically,  the 
triple  alsuiinc  yeast  GCN5  mutation  corresponding  to  PRM 
in  residues  527-529  of  FCAF  was  among  the  most 
debilitating  triple  mutation  identified  (Wang  et  al.y  1998). 
The  C-terminal  loop-p6  region  was  found  to  be  even 
more  mutationaily  sensitive.  The  yeast  GCN5  KDY  triple 
mutation  corresponding  to  residues  638-640  of  PCAF 
(Wang  et  gL,  1998)  and  the  single  mutations  corresponding 


to  ne637,  Tyr640,  Thcti44  and  Leu645  were  all  found  to  be 
among  the  most  debilitating  mutations  (Kuo  etaL,  1998). 

Interestingly,  residues  proximal  to  the  cocnz>’me  A- 
binding  site,  but  not  directly  involved  in  coenzyme  A 
binding,  are  also  highly  conserved  and  sensitive  to  muta¬ 
tion.  These  residues  cluster  to  the  loop  immediately 
following  the  35  strand  (Figure  IB).  Specifically,  Ala6()9 
and  Asp6l0  arc  strictly  conserved  within  the  GCN5/PCAF 
subfamily  of  HAT  proteins,  and  the  triple  DNY  mutation 
in  yeast  GCN5,  corresponding  to  residues  610-612  of 
PCAF,  are  defective  in  both  growth  and  transcription 
in  vivo  and  HAT  activity  in  vitro  (Wang  et  al,  1998) 
(Figure  3).  These  results  suggest  that  this  region  of  PCAF, 
at  the  junction  between  the  cleft  and  the  coenzyme  A- 
binding  site,  also  plays  an  important  role  in  substrate 
binding  and/or  catalysis. 

Catalysh  by  PCAF 

Acetyl-coenzyme  A-dependent  transferases  catalyze  the 
transfer  of  an  acetyl  group  to  the  substrate  through  one 
of  two  mechanisms.  The  ping-pong  mechanism  involves  a 
covalent  protein  intermediate  in  which  acetyl  •coenzyme  A 
binds  to  the  enzyme  and  acetylates  an  active  site  nuclco- 
pliile  which  in  turn  transfers  the  acetyl  group  to  the 
substrate.  Tlw  second  mechanism  requires  formation  of  a 
ternary  protein-cofactor-substrate  complex  and  proceeds 
through  the  direct  nucleophilic  attack  of  substrate  on 
acetyl-cocnzymc  A.  This  ternary  complex  mechanism 
usu^y  requires  the  presence  of  a  protein  side  chain  to 
serve  as  a  general  base  for  substrate  proton  extraction  to 
facilitate  acyl  addition.  Inspection  of  the  PCAF  structure 
reveals  diat  there  is  no  residue  in  the  proximity  of  the 
active  site  to  function  as  a  nucleophile  via  the  ping-pong 
mechanism.  Cy5648,  which  in  theory  could  act  as  a 
nucleophile,  is  strictly  conserved  in  the  GCN5.FCAF 
subfamily  of  acetyltransferases,  but  is  too  far  from  the 
active  site  to  play  a  catalytic  role.  The  inability  of 
Brownell  and  Allis  (1995)  to  prepare  a  covalent  pHJacetyl 
intermediate  of  Tetrahymena  GCN5  using  pHJacecyl- 
coenzyme  A  also  argues  against  a  ping-pong  mechanism 
for  PCAF. 

Inspection  of  the  substratc-binding  cleft  of  PCAF  reveals 
that  there  are  two  residues  that  arc  in  sufficient  proximity 
to  act  as  a  general  base  for  catalysis  via  a  tcmar>'  complex 
mechanism  (Figure  4A).  These  residues,  Glu570  in  the 
34-strand  and  Asp610  in  the  loop  between  the  pS-straiid 
and  the  a4-helix,  are  both  located  in  the  core  domain  of 
PCAF  and  are  strictly  conserved  within  the  GCN5/PCAF 
subfamily  of  histone  acecylnnnsfcrases.  Mutational  ana¬ 
lysis  strongly  favors  the  catalytic  mvolvemcnt  of  G!u570 
since  mutation  of  the  corresponding  residue  in  yeast  GCN5 
(Glul73)  to  alanine  or  glutamine  is  one  of  the  most 
debilitating  mutations  within  the  HAT  domain  of  yeast 
GCN5  in  Iwth  transcriptional  activation  in  vivo  and  histone 
acetylation  in  vitro  (Wang  et  at.,  1998;  R.Howard,  R.C. 
Trievel,  R.Marmorstein  and  S.L.Berger,  unpublished).  In 
contras:,  mutation  of  the  yeast  counterpart  of  Asp6l0  in 
PCAF  is  only  margmally  compromised  In  both  transcrip¬ 
tional  activation  in  vivo  and  histone  acetylation  in  vitro 
(Kuo  et  aL,  1998). 

Close  inspection  of  the  protein  environment  proximal 
to  Glu570  shows  that  it  is  in  an  ideal  environment  to  play 
a  catalytic  role  (Figure  4).  First,  Glu570  is  located  proxim^ 
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F^.  4.  Histone  acctyitnuist'erosc  active  site  of  PC.'VF.  (A)  Dcuiled  view  of  the  PCAP  active  site.  A  clo»«*ap  view  around  the  putative  general  base 
Oiu570  is  shown  in  red  with  the  P-mcrcsptocihylamine  tnoicJy  of  the  cocnzyrtie  A  shown  in  aqua.  Hydrophobic  a.nd  polar  side  chains  are  indicated 
in  blue,  the  ore  tcidio  side  chain  in  the  vicinity.  A$p6lO.  b  indicated  in  pink,  and  two  buic  side  chains  m  the  vicinity.  Aig56l  and  Ly5632,  ate 
indicated  in  green.  (H)  Blcctrcstaric  sorfacc  of  PCAF  looking  into  the  active  site.  Red  indicates  region?  of  negative  etecirostadc  potenihJ,  blue 
indicates  regions  of  positive  electrcsuiiic  potential  and  white  indicares  neutrally  charged  regions.  Coenzyme  A  is  indicated  as  a  stick  figure. 

(C)  Proposed  reaction  mechanism.  Protein  residue.^*  and  coenr/me  fuactionalitics  that  play  a  direct  role  in  the  catalytic  mechanism  arc  indicated.  The 
hydrophobic  residues  (F563,  F568, 1571.  V572,  L60d,  1627  and  Y640)  that  function  lo  pK^  of  the  catalytic  base  (E570)  and  the  backbone 

KH  of  C574  that  serves  to  .•icabnize  the  tetrahedral  reaction  mieraicdiatc  are  indicated. 

ro-iSe- 
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to  an  acidic  patch  described  above  which  forms  an 
attractive  surface  for  the  bask  lysine  substrate  (Figure  4B). 
Secondly,  the  carboxylate  of  Glu570  is  surrounded  by 
several  hydrophobic  residues  (Phc563»  Phc568,  nc571, 
Val572,  LeuSOd,  ne637  andTyr640)  that  probably  function 
to  raise  the  p/C|  of  the  glutamate  side  chain  and  thus 
facilitate  proton  extraction  from  the  lysine  substrate. 
Thirdly,  the  carboxylate  of  Q]u570  is  only  ~1L5  A  away 
from  the  putative  position  of  the  reactive  thiocster  (adjusted 
by  a  rotation  about  the  2-3  bond.  Figure  2A)  of  acetyl- 
cocnzymc  A  (Figure  4A).  Depending  on  where  the  lysine 
substrate  binds,  proton  extraction  may  proceed  directly 
through  the  carboxylate  of  G]u570  or,  alternatively, 
through  a  water  molecule.  Consistent  with  the  involvement 
of  a  water  molecule  in  catalysis  is  the  presence  of  a  water 
molecule  tightly  bound  to  the  carboxylate  oxygen  of 
Gluo70  which  is  closest  to  the  coenryme.  Significantly, 
this  water  is  present  in  both  PCAF  complexes  in  the 
asyiumetric  nmt.  The  final  requirement  for  catalysis  is 
the  presence  of  hydrogen  bond  donors  to  stabito  tbc 
tetrahedral  reactiem  intermediate  involving  the  substrate, 
PCAF  enzyme  and  acetyl-coenzyme  A  oofactor.  The  only 
potential  hydrogen  bond  donor  in  the  binary  complex  is 
the  backbone  NH  of  Cys574,  although  in  tbc  presence  of 
subsn^te  additional  donors  may  also  be  provided  by  one 
or  more  backbone  NHs  of  the  histone  or  transcription 
factor  substrate.  Based  on  the  discussion  above,  we  propose 
a  mechanism  for  catalysis  illustrated  in  Figure  4C. 

Implications  for  coro  dowmn  atructuro  and 
Goenzyme  A  binding  for  other  N-seetyltrensfersses 
Recently,  the  structures  of  the  cocnz>mc  A-bound  forms 
of  two  other  members  of  the  GNAT  superfamily  of 
N-acet>itrarwferascs  have  been  reported;  Sacthuromyces 
ccrcvislae  histone  acetyltransferase  1  (HATl)  (Dumall 
et  al,  1998)  and  the  SerraHa  marcescerts  aminoglycoside 
3-iV-aoecyltransferasc  (ShlAAT)  (Wolf  et  a/„  1998).  A 
structural  comparison  of  the  PCAF  HAT  domain  witii 
these  proteins  reveals  that  tbc  PCAF  core  domain,  formed 
by  motifs  A  and  D,  superimposes  well,  with  r.m.s,  devi¬ 
ations  between  atoms  for  PCAF  compared  with  HATl 
and  SmAAT  of  0.74  and  0.80  A,  resi»ctively  (Figure  5B). 
Interestingly,  the  recently  published  structure  of 
A^-rayristoyi  transferase  (NMT)  (Bhamagar  et  aL,  1998; 
Weston  ctaU  1998),  which  uses  a  myristoyl-Co A  cofactor 
to  modify  the  N-terrainal  glycine  of  substrate  proteins, 
also  shows  homology  withta  the  core  domain  of  PCAF 
(r.m.s.  between  Ca  atoms  of  0,93  A),  despite  me  fact  that 
NMT  shows  no  sequence  homology  with  the  GNAT 
.supcrfamily  of  iV-acetyltransferases  (Modis  and  Wierenga, 
1998).  Surprisingly,  motif  B  (Figure  IB)  of  the  GNAT 
superfaniily,  which  shows  sequence  homology  comparable 
with  chat  of  motifs  A  and  D  (NeuwaJd  and  Landsman, 
1997).  shows  no  stractural  homology  between  the  PCAF, 
HATl  and  SmAAT  proicirw  (Figure  5A).  Instead,  there  is 
a  small  region  of  structural  homology  between  these 
proteins  just  C-cerminal  u>  motif  A  which  forms  a  short 
tum-strand-lum  substructure  which  we  call  motif  B* 
(Figure  5C). 

Superposition  of  the  core  domain  of  PCAF  with  the 
corresponding  regions  of  HATl  and  SmAAT  shows  an 
excellent  superposition  of  the  pantetheine  arm  and  pyro¬ 
phosphate  ^oups  of  coenzyme  A,  while  the  ribose 


sugar  and  adenine  base  adopt  different  confonnati<»s 
(Figure  5B).  Significantly,  the  majority  of  the  interactions 
between  the  A  motif  of  the  structurally  conserved  core 
and  the  pantetheine  arm  and  pyrophosphate  group  of 
coenzyme  A  arc  conserved  between  the  three  proteins 
(Figure  5C).  Specifically,  a  scretch  of  seven  residues  in  a 
loop-helix  region  (residues  581-587  in  PCAF)  make  a 
conserved  set  of  backbone  contacts  to  the  p>wphc5phaie 
group  of  coenz>'me  A.  Significantly,  these  residues  harbor 
the  conserved  and  mutationally  semitive  QTlxxGxG/A 
motif  found  in  a  large  number  of  coenzyme  A-bindinp; 
proteins  (NtuwalJ  AiiU~1:4tjil5man.  1997).  and  shown  in^ 
this  and  other  studies  to  be  an  important  stractural  com¬ 
ponent  for  coenzyme  A  binding  (Dutnall  tt  al,,  1998; 
Wolf  et  oL,  1998).  In  addition,  a  three  amino  acid  stretch 
of  residues  at  the  tip  of  the  P-strand  of  motif  A  (residues 
574-576  in  PCAF)  and  the  first  residue  of  the  Q^FxxGxG/ 

A  motif  (residue  580  in  PC.AF)  also  make  conserved  van 
dcr  Waab  and  hydrogen  bond  icteractions  throughout  the 
pantetheine  arm  of  the  coenzyme  A. 

Residues  just  C-teiminal  to  die  structurally  conserv’ed 
B'  motif  also  make  coenzyme  A  contacts  in  all  four 
protein  structures  (PCAF.  HATl,  SmAAT  and  NMT); 
however,  there  is  no  pattern  of  conservation  between  these 
contacts.  The  importance  of  these  residues  in  the  RAT 
activity  of  PCAF  is  suggested  by  their  strict  conservation 
within  the  GCN5/PCAF  subfa^y  and  by  their  high 
degree  of  mutational  sensitivity  (Figure  IB).  Interestingly, 
these  residues  arc  located  in  a  region  overlapping  the 
putative  substrate^binding  site  of  PCAF.  Taken  together, 
these  observations  suggest  that  the  protein  regions  just 
C-tenninal  to  motif  A  of  the  core  may  play  an  in^ortanl 
general  role  in  correctly  orienting  acetyl-coenzyme  A  for 
substram-spcdfic  catalysis  and/or  play  a  direct  role  in 
substrate-specific  recognition.  Cemsistent  with  tiiis  hypo¬ 
thesis,  PCAF  and  HATL,  which  acetylate  protein  sub¬ 
strates,  show  an  extension  of  the  homology  within  the 
motif  B'  regions  to  an  additional  helical  segment  [a4  in 
PCAF,  and  a9  in  HATl  (Dumall  et  al,  I99S)].  In 
contrast,  SmAAT.  which  catalyzes  the  acetylation  of  an 
aminoglycoside  substrate,  contains  a  p-strand  in  the  corres¬ 
ponding  position  (Wolf  et  al.,  1998). 

Ibken  together,  the  degree  of  structural  conservation 
within  the  A  and  D  motifs  of  the  GNAT  proteins  (Neuwald 
and  Landsman,  1997)  PCaF,  RATI  (Dutnall  et  aL  1998) 
and  SmAAT  (Wolf  etal,  1998),  as  well  as  the  conservation 
of  coenzyme  A  contacts  within  these  proteins,  suggests 
that  ocher  GNAT  family  raembeT?  will  share  hcrnologous 
structural  and  functional  cceazyme  A-bind:ng  properties. 
The  fact  that  this  structural  homology  also  extends  to  the 
unrelated  NMT  protein  (Bhatnngar  et  al,  1998;  Weston 
et  aiy  1998)  suggests  that  the  core  domain  of  PCAF  may 
form  a  structural  paradigm  that  extends  beyond  just 
the  acetytransferase  proteins  that  constitute  the  GNAT 
superfamily  (Neuwald  and  Landsman,  1997), 
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Implfcations  for  substrato  binding  and  catalysis  by 
other  N-acetyltransferases 

Regions  N-  and  C-tcnninal  to  the  PCAF  core  domain 
show  no  sequence  homology  witii  OLher  acetyl  transferase 
enzymes.  Interestingly,  however,  the  N-terminal  segment 
of  PCAF  shows  structural  hgmology  with  the  HATl, 
SmAAT  and  NMT  protems  (Figure  5 A).  The  structural 
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Fig.  5.  Comparison  with  ofihcT  acctyhransfcrafie  enzyfnei.  (A)  Supcrpositicn  of  ttK  PCAF,  HATi  (DutnaJl  oA,  1998)  and  S/nAAT  (Wolf  et  aL, 
1998)  prowins.  The  color  coding  for  PCAF,  HATl  and  SmAAT  is  red.  Hue  and  aqua,  respectively,  The  NMT  (Bhatnagar  aL,  1998;  Weston  c/  aL, 

1 998)  protein  shows  a  comptnble  supetposidon  but  waa  ocniited  Ibr  clarity.  Only  the  coensryme  A  cofacter  from  PCAF  is  shown  in  yellow  for 
clar'ty.  (B)  Superposition  of  the  core  domain  and  coenryme  A-binding  site  for  PCAF,  HATl  and  SmAAT.  In  the  superposition,  the  cote  domain 
(mocift  O,  A  and  is  supcrlmpoxd^Coengymc  A  is  ihawn  in  the  color  of  the  protein  with  which  it  is  associawd.  (C)  Sequence  and  iecondaiy 
''TtniSure  alignment  of  PCAF,  HATl  and  SmAAT  The  •  i>Tnbol  indicatet  residues  that  pUy  conserved  roies  in  protein  ytabiliiy.  and  the  box 
symbols  indicate  residue!  that  pby  conserved  roles  in  coenxyme  A  binding;  the  Q  symbol  indicates  baclcbotie  imemetions  and  the  gl 

indicaws  side  chain  uttcraciions  with  coen^yntc  A.  Secondary  structural  clernems  of  the  reipective  ptoieins  are  shown  above  the  sequence  alignment. 
The  alignment  of  the  B  motif  of  HATl  was  based  cn  the  stiuciural  alignment  with  PCAF  wd  differs  from  tne  sequence  alignment  described  by 
Neuwald  and  Landimin  (1997).  We  have  called  this  modified  version  of  motif  B,  iiudally  described  by  Ncuwald  and  Landsman,  motif  B'. 
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homology  between  these  proteins  is  focmed  by  a  ^itrand- 
tum-Ct-helix-hira,  in  which  the  P-strand  forms  conserved 
sheet  interactions  with  the  core  domain  and  the  a-helix- 
mm  region  sits  above  the  protein  core  of  PCAF.  Regions 
C-terminal  to  the  core  domain  of  PCAF  show  no  structural 
homology  with  HATl,  Sm^AT  or  NMT  (Figure  3A). 
This  obscr\'ation»  coupled  with  the  apparent  functional 
importance  of  the  N  and  C-terminal  segments  of  PCAF 
for  substrate  binding  specificity,  suggests  that  die  corres¬ 
ponding  regions  of  other  members  of  Che  GNAT  super- 
family  also  play  an  important  role  in  substrate  binding 
specificity.  This  hypothesis  is  consistent  with  a  general 
model  in  which  substrate  binds  over  the  structurally 
conserved  core  domain  in  close  juxtaposition  to  the  acetyl- 
coenzyme  A  oofactor.  This  binding  is  mediated  by 
N-  and  C-tenninal  protein  segments  which  contribute 
specificity  detemunants  for  substrate;  the  N-ternimal 
portion  contributes  a  homologous  structural  scaffold  con¬ 
taining  substrate-specific  side  chain  dstcmiinants,  whereas 
the  C-tcrminal  segment  also  contributes  to  substrate  bind¬ 
ing  through  structure-specific  components.  As  described 
in  the  preceding  secdon,  the  structurally  divergent  motif 
B  (Neuwald  and  Landsman,  1997)  of  the  core  domain 
may  also  play  a  role  in  substrate-specific  binding  by  tlie 
GNAT  superfamily  of  Ar-acctyltransferascs. 

The  identification  of  the  general  base  within  the  core 
domain  of  PCaP,  coupled  with  the  conservation  of  the 
core  domain  structure  and  the  mode  of  coenzyme  A 
binding  within  the  HATl,  SmAAT  and  NMT  acctyllrans- 
fcrascs,  leads  to  predictions  about  the  mechanism  of 
catalysis  for  these  other  acetylcransferascs.  Foremost,  it 
seems  likely  that  like  PCAF,  these  other  acctyltransferascs 
carry  out  c^lysis  through  a  ternary  complex  mechanisuL 
This  is  supported  further  by  the  absence  of  conserved 
residues  within  the  active  sites  of  the  HAll,  SmAAT  and 
NMT  enzymes  that  may  play  a  role  as  a  nucleophile  in  a 
proposed  ping-pong  mechanism.  Interestingly,  a  structural 
superposition  of  the  core  domain  of  PCAF  with  the 
respective  core  domains  of  HATl,  SmAAT  and  NMT 
reveals  the  presence  of  acidic  residues  that  superimpose 
closely  with  Glu570  of  PC.AF  and  that  are  thus  implicated 
as  playing  a  catalytic  role.  Specifically,  superposition  of 
the  core  domains  of  PCAF  and  SmAAT  shows  that  Asp  LIO 
of  SmAAT,  locatad  on  an  anoJogouD  ^  stfODd  of  SmAi*T, 


'Ts  In  posjrion  to  act  as  a  general  base  for  catalysis 
(Figure  5B).  This  is  consistent  with  the  proposed  catalytic 
role  of  this  residue  by  Burley  and  co-workcrs  (Wolf  etal, 
1998),  and  witli  the  assumption  that  the  bound  spermine 
molecule  in  the  SmAAT  structure  mimics  the  position  that 
would  be  occupied  by  the  aminoglycoside  substrate  (Wolf 
etaU  1998).  Superposition  of  PCAF  with  the  core  domain 
of  HAT  I  reveals  that  the  Glu255  of  HATl,  emanating 
from  a  ^-strand  just  C-tennioaJ  to  tlie  conserved  A 
motif,  maps  closely  to  ihc  position  of  Glu570  of  PCAF 
(Figure  5B),  Interestingly,  this  glutamate  residue  in  HATl 
forms  an  insertion  sice  relative  to  the  homologous  position 
of  PCaF  and  SmAAT  widiin  the  structurally  conserved 
motif  B'  (Figure  5C).  Consistent  with  the  importance  of 
Glu255  in  catalysis  by  HATl  is  its  suict  conservation 
across  different  species  of  HATl  (Dumail  et  a^.  1998). 
Interestingly,  a  superposition  of  the  core  domain  of  PCAF 
with  the  respective  core  domain  of  NMT  Reveals  that 
Glul67  of  NMT  is  in  an  almost  idemicall  position  to 


Glu57dlof  PCAF.  Although  this  does  suggest  a  catalytic 
role  for^ul73  in  NMT,  recent  mutational  and  structural 
studies  indicaxe  that  the  C-lcrrainal  carboxylatc  group  of 
NMT  (which  is  in  approximately  the  same  region)  plays 
a  more  important  catalytic  role  (Rudnick  et  al.,  1992; 
Bhamagar  et  1998;  Weston  et  ai,  1998). 

Conclusion 

The  smicTure  of  the  PCAF-coenzyrac  A  complex  has 
revealed  an  enzyme  primed  for  substrate  binding  and 
catalysis.  Coupled  with  the  extensive  mutational  data  on 
PCAF  (Manincz-Balbas  <?f  a/.,  1998)  and  the  highly  related 
yeast  <>CN5  enzyme  (Kuo  et  aL  1 998;  Wang  et  aL,  1998), 
the  PCAF  HAT  domain  structure  affords  the  details  of 
cofactor  binding  and  has  implications  for  the  mechanism 
of  substrate  biding  and  catalysis.  Comparison  with  the 
structures  of  HATl,  SmAAT  and  NMT  implies  that 
other  AT-acetyltransferases,  such  as  those  that  function 
to  acctylatc  histone  and  Cranscriptiou  factor  substrates 
including  ES Al ,  TAFii250  and  CBP,  may  have  similar 
structural  and  functional  properties.  Further  insights  will 
undoubtedly  be  provided  by  the  structure  of  other  HAT 
enzymes,  appropriate  ternary  enzyme  complexes  with 
coenzyme  A  and  substrate,  and  detailed  biochemical 
analysis  of  substrate  binding  and  enzyme  catal^^is. 
Nonetheless,  the  structure  presented  here  forms  a  paradigm 
for  substrate-specific  binding  and  catal^ncic  mechanism, 
not  only  for  the  GCNS^TCAF  subfamily  of  histone  acetyl- 
transferases,  but  also  for  other  N-acetyltransferases  that 
function  to  acctylste  histones,  transcription  factors,  or 
other  protein  or  small  molecule  substrates. 


Materials  and  methods 

expression  end  puriftcation  of  the  recomhinsnt  PCAF  HAT 
domsin 

The  DNA  sequence  encoding  residues  493-45S  (including  aniY-terminal 
Met-Lys  sequence)  of  PCAF  was  amplified  by  PCR  and  subcloned  into 
the  pRS£T-A  vector  (Invilrogen)  for  ovcicxprcsvcn.  The  pliumid  was 
tmiufonncd  into  E,cofi  Stain  BL21(DE3)  and  ovcrcxprcsscd  by  induction 
with  0.5  mM  isopropyJ-P-D-thiOgUlactopyranosidc  CTFTG)  and  grown  at 
i5*C  for  12  h.  Foilowinp  jonicAtion,  the  protein,  which  ww  contained 
predominantly  within  the  soluble  fraction,  wiw  purified  with  sequential 
luo  of  SP-Sephaio$c  (Pharmacia)  cation-cxchangc,  coenzyme  A.*agarose 
(Sigma)  and  Supeidex  75  (Pharmacia)  gel  filtration  chromatographies. 
Gcl  filsation  revealed  that  the  PCAF  HAT  domain  was  monomerio  in 
solution.  Purified  protein,  which  was  judged  to  be  >99%  pure  b>'  SDS- 
PAC£.  was  conccntratod  to  -2CM0  ittg/ml*  flash  froscn,  and  stored  at  - 
70*^  in  a  buffer  contaiiung  20  mM  Na-citatc  pH  6.0.  130  mM  NaCl, 
10  mM  B-mercaptoethanol. 

CrystofilzaUon  end  dsta  eoffectfon 

Crystals  of  the  PCAF-cocnzyme  A  complex  were  obtained  at  20*C 
using  the  vapor  diffusion  hanging  drop  method.  i\n  aliquot  (3-6  pi)  of 
a  pratein-cofactor  mix,  contaim.Tg  10  mg^ml  of  protein  with  a  2’fold 
molar  e.>ccess  of  cofactor,  was  mixed  with  an  equal  volume  of  reservoir 
solution  containing  1.3-1 .6  M  and  O.l  M  Tris-KCl  pH  8.S. 

Although  Na*acctyl-coenzyiiic  A  used  as  Lhe  i;ofiictOf  in  the 
cxysullizatsonv  only  coenzyme  A  was  modeled  in  the  final  *rtnjcturTc 
(see  discussion  below).  Equilibraiion  of  the  crystallization  crop  against 
1  ml  of  rescrv’Oir  soluhon  produced  rod-shaped  cry  xtaJs  within  2-3  weeks 
with  average  cell  dimer.sio.ns  of  0.2X0.08X0.08  mm.  Crystals  were 
IransferreJ  lequcnlklly  into  a  cryoprotcctant  soludan  centaining  1.5  M 
0.1  M  Tris-I'ia  pH  8.5  and  .‘5%  ethanol  prior  to  fla.vh  hewing 
them  in  liquid  propane  for  dau  collection.  Diffraction  data  was  collected 
on  beamlinc  X4-A  (X  -  1.0009  a)  31  the  Nauonil  $ync.hrotron  Light 
Source  at  Broofchaven  National  Laboratoiy  from  a  sirg'.e  ciysuU  at 
180®C  using  a  Ritxis  TV  image  plate  detector.  The  data  were  pweessed 
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Ciyttal  structure  of  tha  PCAF-«oenzyme  A  compftx 


«uk1  sealed  using  DEN20  and  SCALEPACK  (CXwinowski,  1993) 
(TaUe  I). 

Structum  dMmwntion  and  refinement 
Jht  structDi#  of  ths  PCAF-cociaymc  A  complex  WM  Wived  tjy 
molecular  rtplacemcut  using  the  program  AMORE  1994),  wiih 

a  paitially  refined  model  of  residues  49-198  of  the  Tetrahymena 
thtTmopMlUa  OCNS  (IOCN5)  HAT  dwiialn  (J.KLRojas,  R.C.Trievel, 
YJdo,  XU,  J.5aiou,  S-L-Bcrgct;  CD.A1US  luid  R.Maimcntcin,  submit¬ 
ted}.  Rotational  and  translitiofMa  searches  yielded  two  solutions  Aat 
woe  related  by  non-crysuilographic  symwetty  (NCS)  with  an  cstimaad 
solvent  conicin  of  Prior  to  nefincnient,  a  randomly  generated  10% 
of  the  lenections  was  designated  as  an  Rit*  set  to  monltcc  the  progtess 
of  the  miinesnent.  Following  rigid  body  fcfinenicnt  from  10  to  3.0  A 
resolution  with  the  program  X-PLOR  (Brtngcr,  1992),  the  initial  eleccron 
density  maps  generated  with  oA-wdghicd  Fourier  coefficients  liTol  - 
IR^l  and  ‘F’ol  -  1/?^!  showed  clear  aide  chain  density  for  maat  of  the 
PCAF'Spccific  residues  that  woe  orahted  Cor  the  moleadar  replacement. 
These  residues  were  built  into  election  density  using  the 

progjam  O  (Jones  cr  aL,  1991),  producing  a  model  that  contained 
residual  498-646  of  PCAR  Ailcc  One  round  of  positional  refinement 
and  simulated  annealing  (Rrungcr  and  (GrukowSld.  1990)  usi^  Strict 
NCS  coiistraijils  from  10.0  to  3,0A.  election  density  maps 

showed  itTODg  pcaJo  for  the  pantothenic  acid  and  the  pyrophosphates 
of  the  3'’pbOSphttc  ADP  moiety  In  coenzyme  A  ia  addition  to  several 
additional  C^ierminal  protein  residues.  After  including  the  coenayme  A 
and  C-:termiAd  protem  residues  m  the  model  whh  0..  rafinemeni 
proceed  by  multiple  rounds  of  positional  refinement,  simulated 
annealing  (Brunger  and  Ktukowsid,  1990)  and  tenion  angle  dynamics 
(Rice  and  Brunger,  1994)  with  periodic  model  building  iu  0.  Refinement 
was  extended  in  xe&olution  steps  of  2.7, 2.5  and  2J  A  using  the  prognuns 
X-PLOR  and  CNS-SOLVE  (Biunger  e/  at.  1998).  As  the  resolution  w« 
extended,  the  NCS  restrainta  Mfcrc  gradually  removed.  During  model 
building,  the  model  was  adjusted  pezsodicully  CO  simulaied  annealed 
emit  wapi  (BrUngcr  ef  a/.,  1987)  that  were  generated  over  the  entire 
structure  by  omittiDg  5-10  residues  at  a  time.  At  the  final  stages  cf 
mfinemvnt,  a  bulk  solvent  correction^  (Jiang  and  Utuogcr.  1994)  was 
applied  using  dan  from  20.0  to  2.3  A,  and  tightly  oonstrained  atomic 
B-factor  wen?  irfmed  with  CNS-SOLVE.  Water  molecules  were  built 
into  strong  tF^ji  -  IBci  p«lo  and  only  retained  if  possible  hydrogen 
bond  psutnsrs  could  be  located  and  if  they  refined  to  reasonable 
atOTTuC  B-factort. 

The  fiTjal  itnietnre  contains  residues\493-633  (and  an  AT-tcnnlnal 
lysine)  of  complex  A  and  msldues  493-613  (and  an  iV>{erminal  lysine) 
of  complex  B  in  the  aiyrnmctric  unit  cell.  Complex  A,  whkh  makes 
more  ccystal  lattice  contacts  than  complex  B,  is  better  ordered,  with  an 
ivenigc  atomic  B-faclor  of  31 .8  A^.  Residues  in  complex  B  have  an 
average  utornk  B-faClor  of  41.1 .1-,  and  the  side  chains  of  naiducs  503, 
505,  625,  626,  627,  631  and  636  weic  modeled  as  alaninc.s  since  side 
chain  deiuity  was  not  observed  for  these  residues  ‘.n  the  final  electron 
density  map.  Each  ptntein  in  the  asymmetric  unit  is  bound  to  ono 
molecule  of  coenzyme  A.  Although  acetyl-coeozyme  A  was  included 
diifing  crystallization,  neither  complex  shows  strong  density  for  the 
acetyl  group  or  the  thioestex  bond  of  acetyl-cocnrymc  A,  suggesting 
that  the  acc^l  group  was  hydrolyzed  in  sohitioQ  or  that  file  acetyl  group 
is  highly  flexible  and  disordered.  The  final  structura  hni  an  Rf**  of 
25,8%  and  on  of  22  with  exceilant  geometry  (Table  I)  and 

none  of  the  non-glydnc  rcxiducs  lying  in  disallowed  regions  of  the 
Ramactondran  plot  (Kleywegt  and  Joixs,  i996b).  ^ 
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ABSTRACT 

The  p300/CBP  associated  factor  (P/CAF)  is  a  transcriptional  coactivator  that  was 
originally  identified  through  its  ability  to  interact  with  p300/CBP,  and  to  counteract  the 
ability  of  the  adenoviral  El  A  oncoprotein  to  inhibit  p300/CBP-mediated  transcriptional 
activation.  Specifically,  P/CAF  contains  histone  acetyltransferase  activity  that  is 
associated  with  transcriptional  enhancement  and  myogenesis.  In  addition  to  nucleosomal 
histone  H3  acetylation,  P/CAF  enhances  transcription  by  acetylating  transcriptional 
activators  such  as  p53.  In  order  to  gain  insight  into  P/CAF-mediated  transcriptional 
activation,  we  have  determined  the  2.3A  crystal  structure  of  the  histone  acetyltransferase 
domain  of  P/CAF  bound  to  coenzyme  A.  The  structure  reveals  an  o/p  globular  fold  that 
contains  a  central  protein  core  which  sits  at  the  base  of  a  pronounced  cleft  that  is  formed 
by  the  N-  and  C-terminal  protein  segments.  The  protein  core  at  the  base  of  this  cleft 
makes  extensive  contacts  with  the  pantetheine  arm  of  coenzyme  A,  marking  the  active 
site  of  this  histone  acetyltransferase  domain.  Additionally,  correlation  of  this  structure 
with  extensive  mutagenesis  data  for  P/CAF  and  the  homologous  yeast  GCN5  protein 
implicates  this  cleft  and  the  N-  and  C-terminal  segments  to  play  an  important  role  in 
histone  binding.  Inspection  of  this  histone  binding  region  suggests  that  a  glutamate 
residue  within  the  protein  core  plays  a  catalytic  role  for  histone  acetylation.  A  structural 
comparison  with  the  coenzyme  A  bound  forms  of  the  related  N-acetylransferases, 

HAT  1  (yeast  histone  acetyltransferase  1)  and  SmAAT  {Serratia  marcescens 
aminoglycoside  3  N-acetyltransferase  ),  implicates  the  mode  of  substrate  binding  and 
catalysis  by  these  enzymes  and  establishes  a  paradigm  for  understanding  the  structure- 
function  relationships  of  other  enzymes  that  acetylate  histones  and  transcriptional 
regulators  to  promote  activated  transcription. 


